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Abstract — We study the scalability of multihop wireless com- 
munications, a major concern in networking, for the case that 
users access content replicated across the nodes. In contrast 
to the standard paradigm of randomly selected communicating 
pairs, content replication is efficient for certain regimes of file 
popularity, cache and network size. Our study begins with the 
detailed joint content replication and delivery problem, a hard 
combinatorial optimization, which is reduced to the simpler 
replication density problem whose performance is of the same 
order as the original. Assuming a Zipf popularity law, and letting 
the number of files and network nodes both go to infinity, we 

identify scaling capacity laws from 0\y/~Nj down to 0(1). 

I. Introduction 

A novel technology, key for the future Internet, is content- 
based networking [1]: in this paradigm, data requests are 
placed on content, as opposed to network address, and routes 
are formed based on content provision and user interest. 
Content caching is a well-known technique that exploits the 
temporal as well as spatial vicinity of user requests to improve 
the user-perceived Quality of Service (QoS) and network per- 
formance metrics. Its merits have been widely demonstrated 
in various networking paradigms, such as Content Delivery 
Networks (CDNs), Publish-Subscribe and Peer-to-Peer (P2P). 

Another major technology for future networks is wireless 
networking, as it supports the exemplar of ubiquitous access 
for mobile users. Unfortunately, long multihop communica- 
tions are known not to scale [2] (i.e., the maximum common 
rate for all flows is inversely proportional to the average num- 
ber of hops). Considering, moreover, the volatility of wireless 
communications and the associated performance bottlenecks, 
it becomes quite important to assess the performance benefits 
of caching and its effect on the system sustainability. 

Aiming to address the performance and sustainability of 
wireless networks with caching, after reviewing past and 
related work and the asymptotic notation in Sections II and III, 
this work makes the following contributions: 

1) It formulates the optimization of content replication 
jointly with routing for minimizing the required wireless 
link capacity (Section IV), a hard combinatorial problem 
that looks at the detailed delivery routes and the cache 
content at each node; this is step-by-step reduced to 

2) a simple and mathematically tractable problem, whose 
scope spans just to the macroscopic frequency of content 
replication (Section V). An efficient solution is designed, 
and is shown to be order efficient to the optimal solutions 
of both the simplified and the original hard problem. 

3) Using the above solution, in Section VI, we study and 
analyze the asymptotic laws of the required link capacity 



as the network (number of nodes) and content (number 
of files) both scale to infinity. Link capacity is shown 
to scale from O (ViV), the non-sustainable regime for 
the wireless network of [2], down to O(l). The latter 
implies that, provided sufficiently broad wireless links, 
the system performance is unaffected of its size. 
4) The precise conditions relating the content volume vs. 
the network size, the cache size and the file popularity 
distribution are presented for caching to make a differ- 
ence in the system performance and scalability. 
The above are summarized in a discussion in Section VII. 

II. Problem Background & Related Work 

Consider a wireless network of N nodes, where multihop 
communications are used to exchange data: in such a network, 
the maximum throughput per node scales as 0( 1 /v / Jv) [2]. 
This celebrated, albeit pessimistic result essentially states that 
per-user throughput drops to zero as the network expands. 
The source of this decline lies in the assumed uniform traffic 
matrix for the generation of throughput demand: the average 
communicating pair distance rises as (\/~N ^ . 

This result stimulated a series of attempts to overrun it, as in 
[3], where cooperative transmissions are employed to preserve 
the user throughput, still hardly avoiding the 1 /VN law. In [4], 
non-uniform transfer matrices are analyzed, identifying laws 
for various types of flows (e.g., asymmetric, multicast, etc.). 
Even in the case that a multihop wireless network is aided 
by infrastructure, to overcome the 1 /Vn law, quite many base 
stations i7^\/W^ are required [5]. Nonetheless, the limitation 
is shown to be in nature geometrical [6], and, thus, organically 
linked to Maxwell's electromagnetic theory. 

In this work (and [7], an earlier version), we focus on nodes 
generating requests on particular content (i.e., files), as op- 
posed to specific destinations. This is a major shift, as files may 
be cached in multiple nodes in the network; hence, requests 
can be potentially served from multiple locations, close to their 
origin. Moreover, we assume the standard uniform distribution 
about the request origin, along with a power law regarding 
the requested content. The goal is to investigate whether the 
enhancement of caching can turn sustainable a large wireless 
network, i.e., overcome the 0( 1 /v / Jv) law of [2]. 

Caching is a technique well-known to improve performance 
in various domains. In wireless networks, performance benefits 
arise from the hop reduction [8]; this work revisits and 
enhances the basic model of [8] in the direction of deter- 
mining the associated scaling laws. In the Publish-Subscribe 
paradigm, caching is employed to preserve information spatio- 
temporally and shield against link breakages and mobility 
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[9]. In wireless meshes, cooperative caching can improve 
performance by means of implementation [10]. 

An exploration of the benefit of caching in the scaling of 
large networks has been conducted in [11], but on a different 
perspective than ours. First, [11] uses arbitrary traffic matrix. 
Second, the cache contents are input parameters, whereas, 
in this work (and in [8]), replication is a key part of the 
optimization. Last, [11] assumes the paradigm of cooperative 
transmissions of [3], which leads to a delivery strategy differ- 
ent than shortest path routing. In total, the network can become 
approximately sustainable with the use of an hierarchical tree 
structure of transmissions over arbitrarily long links. However, 
it should be stressed that the results of [3], [11] depend on the 
particular signal attenuation parameters assumed. 

Here, we study the square grid, a well known 2D model 
[12] of wireless networks' topology and connectivity, and 
symmetric traffic about its origin. This approach is less general 
than [11], aiming to identify definite asymptotic laws and shred 
light on whether caching can make the system scalable. 

To this end, we consider the Zipf law for the non-uniform 
file requests, as in [8]. Literature provides ample evidence 
[13], [14] that the file popularity in the Internet follows such a 
power law. The Zipf parameter ranges from 0.5 [15] to 3 [16] 
depending on the application: low values are representative in 
routers, intermediate values in proxies and higher values in 
mobile applications [17], [18] — see also references therein. 

III. Asymptotic Notation 

Let us define the notation used in the asymptotic laws that 
follow. Let / and g be real functions. Then, / e o(g) if 

/(*) 



for any k > 0, there exists x Q s. t. for x > x a , 



9{x) 



< k. 



Although o(g) defines a set of functions, it is customary to 
write / = o(g) (slightly abusing notation), instead of / € o(g). 

Moreover, / = O(g), if there exists a k > such that f(x) 
is eventually, in absolute value, less or equal to kg(x), that is 



there exist k > 0, x a > s. t. for x > x a , 



/(*) 



9{x) 



< k. 
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Using such a k, we can write that / < kg and / < k g, if 
/, g are positive functions and k' > k. 

If the inequalities on the above definitions are reversed, i.e., 
\f(x)/g(x)\ > k, we get that / = w(g), or / = Q{g); using 

lim i im 

such a k, / > kg and / > k'g, for positive /, g and kl < k. 

lim lim 

In the case that / < g and / > g, we write that / <~ g. 

Last, / = O (g) if / = f2(g) and / - 0(g). 

An important consequence of the above is that / = 0(g) 

does not imply / < g — e.g., consider f(x) = 2g(x); however, 

the reverse is true. Moreover, if / < g, then g — f = 0(g). 

IV. Basic Definitions and the General Problem 

Assume N peers, with N being a square of an integer, 
indexed by n S JV = {1,2,..., N}, arranged on a square 
grid on the plane of size \/N rows times \/N columns. Each 
node is connected to its four neighbors that lie next to it on 



the same row or column with non-directed links. By keeping 
the node density fixed and increasing the network size N, we 
obtain a scaling network similar to [2]. Moreover, to avoid 
boundary effects, we consider a toroidal structure as in [19]. 

This grid structure explicitly considers discrete nodes, in 
contrast to the average cache capacity density of [8]. Unless 
nodes coordinate transmissions in complex schemes (e.g., [3], 
[11]), it is a reasonable communications scheme: the commu- 
nication range is limited due to attenuation and interference; 
using a frequency reuse factor appropriate to the physical layer 
(or TDMA, or random access at the MAC layer), the network 
layer is abstracted to the lattice [12]. Last, this setup essentially 
matches the random communicating pairs of [2]. 

Nodes (or users located therein) generate requests to access 
files/data, indexed by m e M = {1, 2, . . . , M}. Each node n 
is equipped with a cache/buffer, whose contents are denoted by 
the set B n , a subset of A4. If a request at node n regards a file 
m that lies in £>„, then it is served locally. Due to the limited 
buffer capacity, m will often be not available in B n , thus, node 
n will have to request m over the network from some other 
node w that keeps m in its cache. Let, then, W m C N be the 
set of nodes that maintain m in their caches. 

Let, moreover, K be the storage capacity of nodes' cache 
measured in the number of files it can store. This means that 
all M files are of the same (unit) size, placing a constraint on 
the cardinality of cache contents \B n \ < K. The generalization 
to variable sized files can be still captured in this framework 
by splitting each large file into multiple unit segments, and 
then treating its segments as separate, independent files'. 

For the problem of replication not to be trivial, it should be 
K < M, which implies that each node has to select the files 
to buffer in its cache (later, in the asymptotic laws, we shift 
our attention to if < M). Moreover, for the network to have 
sufficient memory to store each file at least once, it should be 



KN > M. 



(1) 



Last, let each node n € N generate requests for data at a 
rate of A„. Although the definitions of Section IV- A cover the 
general case, we stick to the symmetric node traffic, of uniform 
request rate across nodes, X n = A. Each request regards a 
particular file m € M, depending on the file m's popularity 
p m . In essence, \p m ] is a probability distribution, i.e., sets 
the probability of a request for a given file. Clearly, each file's 
replication should be based on its popularity: e.g., to minimize 
the network traffic, popular files should be stored densely. 

A. General Replication-Routing Problem 

Network traffic is the criterion to decide about the file 
replication, and in particular, the lowest link capacity that 
can sustain the request demand. Defining Cg as the rate of 
traffic carried by link £, the network is stable, not discarding 
requests, only if the capacity of link I exceeds Cg. In the 
primary formulation of the problem, we consider the worst 
link case, i.e., the most loaded link, max£ Cg. Next, we also 
consider the average traffic over the links, avgg Cg. In both 

'The presented framework allows for independent storage of a file's 
segments, dispensing with the need to keep them all in the same cache. 
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Symbol Definition 


Symbol 


Set of 
allowed values 


Set 
Cardinality 


Node 


n 


N 


N = 4" 


Alternate Node notation 


0,?/) 


{1 JV5_i>a 


TV 


File/Data 


m 


M 


M 


Buffer/Cache contents 


On 


C a „ T U Q TT 

iee laoie ll 




Path (with use probability) 


[r;vi, ...,v k ] 






Set of Routes V- n ,m,i 
from node n to file rn 1 


V < v n v ikt,„, m 


} 




Route TZ n ,m.i probability 


n.m 


(0,1] 




Node maintaining 
file m in its cache 


W m 




W m 


Node that serves client 
node ra's requests on m 








Nodes served by node w m 
on their requests of m 


n 




\IJ 1 i 1, 


Hop count from node n 
to the serving node(s) W m 


h(n, m) 


[0,+oo) 




Density of data m 








Canonical density of data m 




i} 


1 + v 


Logarithm of can. density 


v m = - log 4 d° m 


{0,1,. 


1 + u 



Problem: 


Worst Link 
Node Capa- 
city (WN) 


Average Link 
Node Capa- 
city (AN) 


Average Link 
Total Capa- 
city (AT) 


Objective 


min C m 


min C AB 


min C AT 


Cost 
Function 


C WB =maxQ 


C AN = C AT A avgCV 

< 


Optimization 
Variables 


[Bn], PVJ 

.,1 st-> (\ n.m n.m n.m \ 

with TZ„,m = {[r z ;v t { v ik \ nm \j 


Constraints 
on Buffer 
Contents 
[Bn]: 


For all n e M, \B„\<K (2) 


^|e„l<fflv(3) 


For all n e N, B n C M (4) 


U B n =M (5) 


Constraints 
on Routes 


, i - n,m n.m n.m 

For all n, m, % : n.v., , v.r, ' . . . , v.,' 
is a list of adjacent nodes 


For all n,m,i : B™.'™ 9 m (7) 


For all n,m : ^r"' m = 1 (8) 


Opt. Value, 
Arguments 


i /v n,m 


£<AN* 
u n > /v n,m 


C AT * 

re AT* -y->AT* 



TABLE I 

List of symbols used in Section IV. 



TABLE II 

List of Joint Replication/Delivery Problems. 



setups, the replication problem regards finding the cache B n 
at all nodes n, that minimize max Ce or avg^ Ce, respectively. 

The individual node capacity constraint of the primary 
formulation is expressed through \B n \ < K, for all n£^; 
again, we also consider an alternate version of the capacity 
constraint, pertaining to the total capacity over the network: 
J2 n \Bn\ < KN. This corresponds to a relaxed network 
setting where the total buffer capacity NK is to be arbitrarily 
distributed to the nodes. As each file should be stored at least 
once in the network, in all cases, it has to be U ne j^B n = M. 

Allowing for receiver driven anycast strategy, the network 
should choose a node w m , n to serve the requests of client n on 
m; w m ,n should be selected from the possibly many candidates 
of set W TO . Therefore, the replication problem is implicitly 
linked to a joint delivery problem of finding appropriate paths 
of adjacent nodes n, v\, V2, ■ ■ ■ , Vk from each client n to a node 
Vk € W m for each file to, that minimize maxCe or avg £ Cg. 

Since there exist multiple routes between a node n and 
the caches containing m, we should allow splitting the traffic 
among them, to balance the load among links. Thus, the 
delivery problem involves finding a set of routes 

-7p jri-) \ rr n.m n.m n.m n,mn 

'<-n,m — i'<-n,m,i/ — \[ r i \ v n -, v i2 i ■ ■ ■ i v ik II 

from each client node v to a server node v^ m € W TO . On each 
route lZ n , m ,i, r™'™ denotes the portion of requests of node n 
for file to that use path n,v™{ m ,v™ 2 ,m , . . . ,w" fe ' m is used by 
node n. Therefore, it has to be J^i r"' m = 1. 

Given the cache contents [£>„] and routes [7£ n>m ], it is easy 
to sum up the traffic Ci per link I; we do not provide the ex- 
plicit formula, but later we carry out associated computations. 

Based on the above, we define three variants of the 
replication-delivery problem (Table II), beginning with the 
primary (and hardest) one, relaxed next to simpler versions. 



Problem 1 [Worst Link Node Capacity (WN)]: 
Minimize max< Ce([B n ], [7\L„ jTn ]), subject to (2), (4-8). 

Problem 2 [Average Link Node Capacity (AN)]: 
Minimize &vg e Ce({B n } 7 {lZ n ,m]), subject to (2), (4-8). 

Problem 3 [Average Link Total Cache (AT)]: 
Minimize avg e Ci{[B n ], [TZ n ^ m ]), subject to (3), (4-8). 

As shown in Table II, we use the * notation to denote the 
optimal value of the objective function, and a minimizing pair 
of the cache contents and routes (as there may exist multiple 
optimal solutions). Note that in the above definitions, we have 
omitted from the argument list the parameters [p m ], [A„] and 
cache capacity K to contain the notation clutter. 

It is clear that these problems are of combinatorial complex- 
ity, and thus are not amenable to an easy to compute, closed- 
form solution. However, as we are interested in asymptotic 
laws, we will use the simpler structure of the latter problems 
to design straightforward replication strategies, and compute 
approximations that are within a constant to the optimal. 

First, observe that, as the WN and AN problems share the 
same constraints, [B™*] , [TZ™* n ] is valid for AN. Moreover, 
for any [£>„], [7£„ ;m ], it is rnaxi d > avg £ Ce, therefore, 

C™*>avgC,([C*],ra)>^- 

t 

Second, the AT problem has relaxed constraints in compari- 
son to AN, i.e., any {[B n ,1Z n , m ]) of AN satisfies AT, too, and 
both problems share a common objective function. Thus, 

Combining the above, it follows that 
Lemma 1 [WN vs. AN vs. AT]: C m * > C AN * > C AT *. 
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Observe that shortest paths are sufficient in the AN 
and AT problems. Indeed, consider for some n, m a route 

r n,m n.ra n.m n,m i . -t~, .1 . ■ 1 

[r j ]v ix ,v i2 , ■ ■ ■ i v ik inm \ m set ><-n,m that involves 
more hops than path [n, v\, V2, ■ ■ ■ , Vk] with m £ B Vk . Re- 
placing it by [r™' m ; v\, V2, ■ ■ ■ , Vk]} (or, if the shorter path is 
already in lZ n . m , adding the routing probabilities) reduces the 
average link load thanks to the new path's fewer hops: 

Lemma 2 [AN- AT Shortest Path Optimality]: The 
optimal routes TZ^ m ,TZ^ m consist of shortest paths only. 

As there may exist multiple paths from a source n to file m 
with the same hop count; in this case, we are free to arbitrarily 
distribute traffic among them. Let, therefore, h(n, m) denote 
the hop-count of a shortest path between n and m. It is 



^2 c f =Yl £ h(n,m)p m . 



E 

neW m£M 



(9) 



For simplicity, the above assumes a unit request rate per node 
A = 1; it is clear that the link capacities of WN, AN and AT 
are directly linear to the arrival rate A. Then, both sides of (9) 
sum the total load on the network: the LHS expresses the total 
load as the sum over all links, while the RHS expresses it as 
the load generated by each file request of each node. 



Problem 4 [Continuous Density (CD)]: Minimize 



V2 
6 



C CD ([d m }) 4 ^ ]T 

rneM 

with respect to [d m ], subject to: 



1 



1) For any m £ Ai, < d„ 



< 1, 



2) E 



meM u " m 



dm < K. 



In the CD Problem, the optimization variables are the 
densities d m , which express the fraction of caches containing 
file m. In the objective, d m 2 — 1 approximates the average hop 
count from a random node to a cache containing m. Weighted 
by the probability p m of requests on m, the summation 
expresses the average link load per request. 

It is clear that any [£>„] that satisfies the WN, AN or AT 
problems' constraints yields a distribution [d m ] that satisfies 
the constrains of CD. Combining this with Lemma 1, 

Theorem 4 [CD bound]: C cd * < C AT * < C* AN * < C m *. 

The t/jv < d m < 1 are an important difference from the 
model of [8], which, as seen next, affects the solution, and has 
a major impact in the asymptotics. 



V. Density-based Formulations 

The above formulation takes a microscopic view on the 
precise routes and specific cache contents. Now, we switch to a 
macroscopic view, that considers the frequency of occurrence 
of each file in the caches. This leads to an easy-to-solve 
problem, which permits finding the asymptotic link rate. 

A. Replication Density and Distance Approximation 

Focusing on the frequency of occurrence of each file m in 
the caches B n , we define the replication density d m as the 
fraction of nodes that store file m in the network: 



dn 



4l> 



(10) 



Under shortest path routing, the inverse of replication den- 
sity d m may be regarded in a fluid approximation as the 
number of peers served by the node that maintains m in its 
cache; hence, it also represents the size of the area served by 
a specific location as the source of information m. 

Note that in the AT and previous problems, d m takes values 
in the set of {1/N, 2/N, . . . , 1}, and moreover, that 

1 



N E 



BJ < 



K. 



Consider now densities [d k ^*\ defined from the optimal 
cache [B^*] and (10). The following is a very useful result 
(see Appendix A for the proof of all results of this Section) that 
naturally lead us to a new problem based on the file densities: 

Lemma 3 [AT Capacity Lower Bound]: 



1 Pn 




B. CD Problem Solution 

First, let us establish the uniqueness of the solution: 

Lemma 5 [CD Convexity]: CD is a strictly convex opti- 
mization problem, and has a unique optimal solution. 

The computation of the solution is relatively easy, using the 
Karush-Kuhn-Tucker (KTT) conditions. Except for the trivial 
case of M > KN, the summation constraint is satisfied as 
an equality. Regarding the pair of constraints on each d m , 
either one of them can be an equality, or none. This partitions 
M into three subsets, one containing files of unit replication 
density (i.e., stored at every node) = {m : d m = 1}, one 
containing files stored in just one node A / J i = {m : d m — jj}, 
and the complementary M l — M \ (M^ U MJ. 

If the files are ordered, then the three subsets are ordered: 

Lemma 6 [Monotonicity of sets M v M v M l ]: Pro- 
vided that p m is non-increasing on m, the optimal solution 
for the CD problem takes the form of M.^ = {1, 2, — 1}, 
M % = {I, I + 1, . . . , r - 1}, and M l = {r, r + 2, . . . , M}, 
where I and r are integers with l<l<r<M + l. 

The uniqueness of the optimal solution [d™*] implies the 
uniqueness of the indices and the three partinitions. We, then, 
denote by l*,r*, and M*, M.*, M* their optimal values. 
Given these, the solution d™* takes the form of 



dT = { 



1, 

K-l* 
1 



1 - 



M-r +1 
N 



£3 



m e M* 
m £ Mt 



m £ M*. 



(11a) 
(lib) 

(11c) 



Fig. 1 illustrates such an example solution, depicting the 
density indices I* and r*, as well as sets M*, M* M* 
when file popularities follow the Zipf law (see Section VI-A). 
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M* 




M* 















Fig. 1 . An example case of density d m and the Ai* , M.* and M* partitions. 
Solid line plots the ~ m 3 law of m 6 M , when p m follows Zipf's law. 



C. Discrete Density Formulation 

Now, we restrict our attention to networks with a number 
of nodes equal to a power of 4, N = 4", and consider a 
constrained version of the CD Problem of discrete densities: 

Problem 5 [Discrete Density (DD)]: Minimize 
C w ([d m ]) = C CT> ([d m }) with respect to [d m ], subject to: 

1) For any m € M, d m = i~ Vm , with v m £ {0, 1, . . . , v). 

2) E m eMdm<K 

It is clear that the CD Problem is a relaxed version of the 
DD Problem, i.e., any [d m ] satisfying DD constraints, satisfies 
CD's constraints as well, hence C CD * < C DD *. 

DD, defined on a discrete set, is not easy to solve. Still, it is 
straightforward to construct an efficient solution d° m = Ar v ™ 
setting it to the largest power less or equal to [d™*] 

d° m 4 max {4-* : 4~ J < (Q*, ie{0,l i/}}. (12) 

Let, then, C DDo = C CDo be the required link rate of the DD 
and CD problems for the [d° n ] replication density. Although 
not optimal, [d^] is quite efficient a solution: 



Theorem 7 [Canonical Placement Efficiency]: 

6 ' 



C CD * < C CDo < 2C CD * + 



D. Replication Policy Design 

We present a replication algorithm that allocates the files M 
in the caches [B n ] given the replication densities d^. In the 
algorithm, we represent each node n by its coordinates (x, y) 
with x and y taking values in {0, 1, ... , y/N — 1}. First, we 
partition the data into the sets Mo,M\, . . . ,M V , such that 
A4i contains data m of d° m = 4~\ Then, each file m gets 
assigned to a unique node in the 2 V ™ x 2^™ submatrix. 

Indeed, we start (step 1) by assigning to S(o,o)> me nrst 1 x 1 
submatrix the elements of Mo, i- e -> the files to be replicated 
at all nodes of the network. Then, in the loop of step 2, we 
scan all sets Mk and allocate its elements to the buffers. 

First, we repeat the 2 k ~ 1 x 2 k ~ 1 first submatrix created so 
far to the 2 k x 2 k submatrix (loop of step 4), and then, we 
iteratively take out of set Mk files at decreasing popularity, 
filling one node in the 2 fe x 2 fe submatrix (step 8). The node 



where each data is placed is selected as a node with the least 
number of files so far and in case of tie, by scanning the 
diagonals (step 12). In this way, we spread files to balance the 
load over the links, a key property for the WN problem. 

Algorithm 1 [Cache Data Filling]: 



B 



(0,0) 



for fee {l,2,...,i/} do 
for x = 0,1,..., 2 fe - 1 do 
for y = 0, 1,. ..,2 fc - 1 do 

B(x, y +2 k - 1 ) &(x,y) {Replicate cache contents} 



B 



(x,v) 



B( x +2 k - 1 ,y+2 k - 1 ) B(x,y) 



'(x+2 k ~ 1 ,y) 

,y+2 k - 

while Mk is not empty do 

m <— argminp m . {Find the file m of highest p m } 



10: 
11: 

12: 



S <- 



meMk 

- Mk \ {m}. {Remove it from set Mk} 



\B< 



{Find the set of 



arg mm 

(z,y)6{0,l,...,2 fc -l} 2 

nodes (x, y) with the minimum number of elements} 
Select node (x,y) from S as the first node by 
scanning from top left to bottom right the main 
diagonal, then next diagonal, etc with wrap around 
in the 2 fe x 2 k submatrix. {See Fig. 2} 



Denote by B° n = B° x ^ the cache contents of node (x, y) 
at the end of the Algorithm 1. Clearly, in B° n each data m 
is replicated 4"™ = N /d° m times in the network, with replicas 
located every 2"™ nodes along the two axis (as in Fig. 3). 
Next, we study the validity and optimality of Algorithm 1. 

E. Validity and Optimality for AN and WN Problem 

Lemma 8 [Algorithm 1 cache size]: Algorithm 1 results 
in caches B n filled with at most K files. 

The above result shows that Algorithm 1 constructs a 
solution compatible to the WN's and AN's constraints. Let 
us specify a set of shortest paths [7\L° ] to use for delivery 
in conjunction to the caches [£>°] (see Fig. 6 in Appendix A): 

• In the case that there are multiple w m nodes at the same 
hop-count from node n, requests of n about m will be 
directed to the w m node at the north and west of node n. 

• If n and w m are on the same row or column of the 
network, we use the single I-shared shortest path. 

• If n and w m are on different rows and columns of the 
network, we use equally the two L-shaped shortest paths. 

The above routes are optimal for the AN problem, but 
not necessarily for WN. Define C ANo = C AN ([B°], [Jl° n J), 

r l y-M-j 2/3 jf K _ 
2^k=i+iPk i 11 A 

1, if K - 



A itj ^ I 



3- > 



J- = 



(13) 



Next results establish the order optimality of the constructed 
solution for the AN and WN problems: 

Theorem 9 [Algorithm 1 optimality on AN]: 
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1 (2 fc -l)2 fc +l (2 fc -2)2 fc +2 ••• 2-2 fc -l 2-2 fc 

2 (2 fc -2)2 fc +2 ••• 3-2 fc -l 3-2 fe 



2 fe +l 



2-2 fe +l 2 fc +2 3 ••• 4-2 fc -l 4-2 fc 



(2 fc -2)2 fc +l (2 fc -3)2 fc +2 (2 fc -4)2 fc +3 • 
(2 fc -l)2 fc +l (2 fc -2)2 fc +2 (2 fc -3)2 fc +3 ••• 2-2 fc -l 2 k 

Fig. 2. The order of precedence in the 2 k X 2 fe matrix (Algorithm 1, step 12). 



•Ill 


•W 


•111 


• |5| 


• HI 


• ^ 


• HI 


• HI 


• a 


•0 


• S 


•H 


• HI 


• HI 


• HI 


• HI 


• HI 


• 14 


•HI 


•0 


• HI 


• HI 


• HI 


• HI 


• a 


•|2| 


• |3| 


•H 


• HI 


• HI 


• HI 


• HI 


•111 


• ^ 


• HI 


• |5| 


• 111 


• 


• HI 


• HI 


• a 


•a 


• |3| 


•S 


• HI 


• HI 


• HI 


• HI 


•111 


• |4| 


• HI 


• Izl 


• HI 


• HI 


• HI 


• 


• |3| 


•a 


• |3| 


• HI 


• HI 


• HI 


• HI 


• HI 



Fig. 3. An example run of Algorithm 1 for Mi = {1,2,3}, M2 = 
{4,5}, M 3 = {6,7,8,9, A, B,C,D, ...}: Files {1,...,D} have been 
placed in the buffers; in subsequent steps, buffers will start receiving their 
second file. The boxes mark the 2 k X 2 k submatrices. 



Theorem 10 [Algorithm 1 optimality on WN]: The 
maximum link load C m ° is within a multiplicative constant 
to the optimal C WN * plus an additive term 2 + Aq<q/4, that 
depends on the distribution \p m ], and cache capacity K. 

VI. Asymptotic Laws for Zipf Popularity 

To study the scaling of link rate, we switch from the 
arbitrary popularity to the Zipf law, a distribution known to 
model well various types of network traffic. 

A. Zipf Law and Approximations 

The Zipf distribution is defined as follows: 

1 



where r is the power law parameter, indicating the rate of 
popularity decline as m increases, and H T (n) = Y^j=i3~ T 
is the truncated (at n) zeta function evaluated at r (also 
called the 71 th T-order generalized harmonic number). The 
limit H T = lim H T (n) is the Riemann zeta function, which 

converges when r > 1. We derive an approximation for H T (n) 
by bounding the sum: for n > m > 0, 

I (x + l)- T dx < H T (n) - H T (m) < 1 + / x~ T dx,=> 

Jm J m+1 



' Kl) ':^' )h < H T (n) - H T (m) < + if r^l, 



\n^<H T (n)-H T (m)<ln^, 



if T=l. 

(15) 

As we are interested in the asymptotic scaling of link rates, 
for notational simplicity, we remove the multiplicative factor 
from the objective function of the CD Problem, and refer to 
the resulting quantity as C. Substituting the solution (11) and 
plugging in the Zipf distribution into (16), it follows that 



M 

c - E - : ) = c i + c i - Y,p™> 



(16) 



where J2j=iPm = 0(1) (as it lies always in [0, 1]), and 



o t = E 



(14) 



H^(r-1)-H^(l-1) 



rneM, 



K. H T (M) 



^= E 



Pm (14) 



H T (M) - H T (r - 1) 
H T {M) 



A (A- - j + l)N - (M - r + 1) 



(17) 
(18) 
(19) 



Last, for simplicity, we drop the star notation and use d m , 
I and r to refer to the values of the optimal solution. 

B. Estimation of I and r 

As indices I, r are not provided in a closed form, we derive 
approximations in order to find the scaling of C. 

1) Estimation of I: first, I < K + 1 (hence I = 0(1)), as 
I — 1 represents the number of files cached in all nodes. If A4^ 
and M. are not empty, using (lib), di < 1 is equivalent to 

M-r+l 



K-l+1- 



N 



i%(r-l)-i%(Z-l) 



• (20) 



If, moreover, the first set M. is not empty, i.e., I > 1, then 
d t _i = 1. This means that if we attempted to decrease index 
I by 1, this would violate the density constraints, and result in 
(lib) a number greater than 1 for di-\: 

11 ' ' • (l-l)^\H^{r-l)-H^(l-2) 



K-l- 



N 



■ (21) 



Thus, provided Z > 1, it can be uniquely determined as 
the lowest integer that satifisfies (20)-(21), which is unique 
(Theorem 5). An approximation for I can be computed treating 
(20) as an approximate equality when M. % ^ 0, or equivalently 
when I < r (as = 1 and d ; < 1): 



K-l + l-*^*l* 



H^(r-l)-H^(l-l) 



(22) 



2) Estimation ofr: If M % \JM } is not empty, d r -\ > 

(K-l+l)N-M+r-l >(r-l)T ^^(r-Vj-H^l-l) . 

(23) 

Again, if set M. is not empty, i.e., r < M, then d r — N^ 1 . 
Thus, if we attempted increasing index r by one, (lib) would 
violate the constraint resulting in a density less than iV _1 : 



(K-l+l)N-M+r < rir \H 2, (r)~H 2,(1-1) 



(24) 
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As before, (23) is an approximate equality if I < r, i.e., 

(K-l+l)N-M+r-l = (r-l)T f/f 2t (r-l)-/^ (Z-l) 

(25) 

3) Estimation of l /r: For all Z, r, it is N > 3^ = (^r)^- 
As before, whenever I and r are not equal to the extremes, 
i.e., Kl<r<M + l,it holds that d l _ 1 /d r = N. Thus, 

I = rN~^ . (26) 

Next, we show that it is impossible to have files cached in all 
nodes unless the popularity parameter r is greater than | (the 
proofs of the results that follow can be found in Appendix B): 

Lemma 11: If t < §, then I ->■ 1. 

We proceed to the asymptotic behavior of the system on the 
rate C, and indices I and r, which govern partiting to M.^ , M % 
and M. The scaling behavior regards the case of the number 
of nodes N and the number of files M increasing to infinity. 
We use I and f to refer to the limits of Z and r. 

First, a set of basic results establish the upper bound of 
C = o(yN^j, the Gupta-Kumar rate [2]. This is intuitive: if 
replication is ineffective (e.g., due to large number of files), 
then the system and its performance essentially reduce to [2]. 



Lemma 12 [Bound on CA: C. 



0(sfN). 

- O (VN^J. Furthermore, 



Lemma 13 [Bounds on C ]: C 

1) for r < 1, and r "< M, it is C^ = (/n), 

2) for r > 1, it is C\ = 0(y/N (r^-M 1 ^)). 

lim 

If moreover, r < M, then C & = 



\/]yA 



Corollary 14 [Bound on C]: C = 0[yN 

Next, we start the asymptotic analysis, partitioning the space 
of M, N parameters according to whether they produce singly 
replicated files or not, i.e., non-empty or empty M. 

C. Almost Empty M. k 

The first case to consider is when the number of nodes N 
and files M increase towards infinity, and at the same time M. 
remains an almost empty set. We define formally M. ~ iff 
= o(M), i.e., the number of its elements is lower order 
than the total files. For this to happen, M should increase at a 
slow pace with N, so that the constraint d m > N~ x is satisfied 
for almost all (i.e., up to o(M)) files. The extreme case of this 
is to first let N — > oo, and then have M — > oo, i.e., split the 
limit of N, M jointly going to infinity to a double limit. 

To study the asymptotics of C, we first estimate I and r. The 
almost empty Ai i implies that M — r = o(M), thus r <~ M . 

Theorem 15 [[for Almost Empty M t ]: 

1) For t < 3/2, 

2) For t > 3 /2, / converges to the integer solution of 

{ fif-!+l)i-T < Hzl-HzlQ-I), 

< 3 3 (27) 

\{K-l){l-l)-^ > 

if such exists and is greater than 1, or 1 otherwise. 



Note that an approximate solution of (27) is obtained from 

K — (I — 1) = (I — 1) \H'2t -H^(l- 1) 



1 (15) I - 1 

= 3- 



2t-3 



2t - 3 



(28) 



Next, we study the conditions to have /A almost empty: 
Theorem 16 [M l Almost Empty]: M -r = o(M) iff 



for t < 3/ 2 , M < (1 - ^) KN, 
for t = 3/ 2 , M In M < KN, 



for t > 3/2, 



M < 



where I 



'{>>! M =0} fr om Theorem 15. If the above 
inequalities are strict, then r = M + 1 (and thus M. i =%). 

Theorem 17 [Capacity for Almost Empty MJ: 

. 7/r < 1, C = (VAtj. 

. 7/r=l, C = o\^M_). 

. // 1< r < 3/ 2 , C = (M 3 / 2_T ). 

. 7/r = 3/ 2 , C = 6>(tog5 M^. 

. 7/r > 3/2, C = 0(1). 

D. Non-empty M t 

When M is non-empty, it is C l > 0. As Corollary 14 shows, 
C is O (y/Ny i.e., the Gupta-Kumar rate [2]. Thus, we turn 

our attention to identifying the cases that C = o ^-\//v)- 

Theorem 18 [Zand f for Non-EmptyTWJ: If M exceeds 
the condition of Theorem 15, 

• if KN —M = uj(1), then we discern the following cases: 

r<3/ 2 : r ~ 3 ~ 2T (KN - M), (29) 



t =3/2 : ?—>•!, rln) 



2r 



KN - M, 



t >3/ 2 and M < (K — (3) N 



K + 1 - lim ■ 



M 
AT 



a 



KN^ - 



M 



t >3/ 2 and M > (K-0)N 
~2t 



1^1, r 



(KN - M) 



(30) 

(31) 
(32) 

(33) 



where a 



2t-3 



if KN - M 2 = 0(1), Then I -> l,r = 0(1), with the 
exact value determined by 



' KN-M+r-1 > (r-l)Tri?2z(r-l), 
KN-M+r < r^H^r). 



(34) 



Note that in the case of r > 3 /2, the asymptotic law for r 
is the same, r = {(KN - M) A) in both (32) and (33). 

Moreover, the approximation of (31) on Z can be precisely 
carried out via (27), if we substitute K with K — lim m /n. 
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TABLE III 

(a) The Cases of r < 1, r = 1 and 1 < r < 3/ 2 . 



M 


M finite 


iV— >oo, then 


M '< ^^KN 

— ZT 


lim 

M > ^^KN, 


M ~ 


KAT 


M — ► oo 


and M < TCiV 


KN-M = w(l) 


ifA r -M = 0(l) 




empty 


empty 


almost empty 


non-empty 


non-empty 


non-empty 




1 


1 


1 


1 


1 


1 


f 


M + 1 


M + l 


71 JT 1 71 

M — o(M) 


(AiV — M) 


3 — 2t / TV" AT 71 jf"\ 

+^-(KN - M) 


0(1) (34) 




r < 1 


0(1) 


©(\/m) 


©(Vm) 


©(\/m) 


©(Vm) 


©(\/m) 


C 


r = 1 


©(1) 


w ^ log M ) 


q( Vm \ 

w \ log m y 


^ V lo s M / 


©(Vm) 


©(Vm) 




1<t<§ 


0(1) 


©(M§- T ) 


©(M§- T ) 






©(\/m) 



(b) The Case of r = 3/2. 



M 


M finite 


N^oo, then 
M -s- oo 


MlnM< X AT 


M In M '> KN and M '< KN 


M ~ KAT 


KW-M = w(l) 


KN — M = 0(1) 




empty 


empty 


almost empty 


non-empty 


non-empty 


non-empty 


f 


1 


1 


1 


1 


1 


1 


f 


M + l 


M + l 


M - o(M) 


r In r ~ ATiV - M 


r In r ~ KN - M 


©(1) (34) 


C 


0(1) 


©(log I M) 


©(log? M) 


© (logs ry 1 




©(\/m) 



(c) The Case of r > 3/2. 



M 


M finite 


N^oo, then 
M -> oo 


M '< WV^ 
(see Th. 16) 


lim 3 

M > hN 2t ; and 

lim 

M < (K — P)N 


M> (K ~(3)N 
and M '< ATA^ 


M ~ ATAf 


ATAT-M=w(l) 


KN-M = 0(V) 




empty 


empty 


almost empty 


non-empty 


non-empty 


non-empty 


non-empty 


i 


©(1) (27) 


©(1) (27) 


©(1) (27) 


^c^+l-limf ] 


1 


1 


1 


f 


M + l 


M + l 


M - o(M) 


~a|i™T* — 

JV 2t _ 


~[^(iTA r -M)]* 


~ [^-(KN-M)] * 


©(1) (34) 


c 


0(1) 


©(1) 


©(1) 


©(1) 


©(1) 


e( — ^_r>) 

V(ifiV-M) 2t J 


0(VM) 



Theorem 19 [Capacity for M < KN, M & ^ 0]: 
. 7/r < 1, C = (9 (\/M). 
. 7/r=l, C = ©fj^). 

. // 1< T < 3/2, C = fM5- T ). 

. 7/r = 3/ 2 , C = 6>(log5r). 
. 7/r > 3/ 2 , C = 0(1). 

Theorem 20 [Capacity for M ~ KN]: 
. 7/r < 1, C = <9 (\/m). 

. / / l< T <3 /2jC = ( p _VM_ 

.7/r = 3 /2j c = ey 
. i/t > 3/2, c = e 



Problem, we consider the scaling of Aq^, from (13), 



3(t-1) 

(KN-M) 2x 



is. Validation for the WN Problem 

Recapitulating on the asymptotic laws, C, C CD *, and C M * 
are of the same order (Theorem 9). To complete with the WN 



M 



1 



Thus, A .o diverges only for r < 3/2, as M r_1 when r < 
3/2, or log M when r = 3/2. As easily verified from Table HI, 
A ,o always scales slower than C, therefore, Table III pertains 
to the scaling of the required rate for the worst link case, too. 

F. Discussion on Asymptotic Laws 

The main result of the asymptotic laws regards the minimum 
required link rate required to sustain a request rate of A = 1 
from each node. As a preliminary comment, the sustainable 
link rates are subject to the information theory and Shannon's 
capacity law. Thus, a rate C that scales to infinity should be 
interpreted rather as the inverse of the maximum sustainable 
request rate A, e.g., the result of C = 0(VM) for A = 1 is 
equivalent to C = 0(1) for A = 0( 1 /Vm), as in [2]. 

The power law parameter r sets two phase transition points, 
1 and 3 /2, leading to distinct asymptotics: the higher r, 
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the more uneven the popularity of files, and thus, the more 
advantage in caching (i.e. lower link rate C). As an example, 
on t > 3 /2 and M < (K — e)N, with e a small constant, 
C = 0(1), or, in words, the wireless network is a sustainable. 
Nevertheless, such high a t arises in particular scenarios (i.e., 
mobile applications), as discussed in Section II. 

More common are the cases of low and intermediate values 
of t (traffic in routers and proxies), which flatten the popularity 
distribution towards the uniform. Replication becomes less 
effective, ending up to the ^/ M^j law for r < 1, a synonym 
of the Gupta-Kumar law, if we associate the number of files M 
to the number of communicating pairs in [2]. When M scales 
slower than N, there is an improvement over [2], especially 
on t > 1, which, under the above prism, expresses the Gupta- 
Kumar law for the flows induced from the replication. 

In an alternate perspective, the joint scaling of M and N can 
be considered as each new node enriching the network with 
its 'own' files. In the case of M ~ SKN being a fraction of 
the total buffer capacity, the node has spare capacity to cache 
other files, too. The value of constant 6 determines the size of 
the set Ai (Theorem 16), and, consequently, the scaling law 

of C: 6>(Vm) for t < 1, g(m^ for 1 < r < §, or 0(1) 

for r > |. Under this perspective, the improvement for t > 1 
spotlights the advantage of replication: as M is of the same 
order with N, the flow model is a fair comparison to [2]. 

Last, when the ratio of M over KN approaches 1, hardly 
does it remain any capacity left for replication: rate C in the 
last two columns of the tables essentially match the Gupta- 
Kumar rate of o(y/M^j. 

VII. Conclusions & Future Work 

In this work, we investigated on the joint delivery and repli- 
cation problem of flat wireless networks with caching. After 
formulating the precise problem on square lattice networks, we 
gradually reduced it to a simple density-based problem; from 
its solution, we designed an efficient order-wise replication and 
delivery scheme and derived the scaling laws on the required 
link capacity. As our study reveals, the key parameters in 
network sustainability are the file popularity parameter r and 
the relative scaling of the numbers of files vs. network nodes. 

A future extension of this work will include a new dimen- 
sion of scaling with node capacity K; that is, in an expanding 
network, not only new nodes and content are added, but 
existing nodes evolve with augmented buffer storage. In such 
a setup, replication is expected to be advantageous order-wise, 
even for low values of r provided that cache capacity K scales 
sufficiently fast with the number of files M. 

Appendix A 

Proof of Lemma 3: Using Lemma 2, we use a single arbitrary 
shortest path for each n, m pair to a unique w m , n . From the 
discussion on d m , the set W m has cardinality W m = Nd m , 
i.e., there exist Nd m nodes in the network that serve the 
requests for file to. Let Q w ^ m be the set of nodes served from 
node w for requests on m, and Q w , m be its cardinality. Clearly, 



E Q 

w . r 



N. 



(35) 



w£W„ 



It is not hard to see that the best arrangement for a node w 
that serves a cluster of Q w . m nodes on requests for data to, is 
when these nodes lie in a square rhombus centered at w: this 
minimizes the total hop-count J2 n eQ m m M n > TO )- Thus, the 
1 st node (w itself) has a hop count h = 0, the next 4 nodes 
have h = 1, the next 8 nodes have h = 2, etc. as illustrated 
in Fig. 4. Therefore, taking into account the nodes of the last 
incomplete ring at hop count equal to p + 1, it is 

Hops for the nodes of the rhombus at hops <p 



y^h(n, to)>1x0 + 4x1 + 8x2h h 4p • p 

Number of nodes at p+1 X (p+1) 

+ \Q Wm -l-2(p+l)p]{p~+Tj 

= 2p(p+l) 2 ^+[Q Wtm -l-2(p+l)p}(p+l), 

where p is the radius of the rhombus. The radius can be 
computed as the integer part p = [p\, where p satisfies 

= 2(p+l)p+l. 

Indeed, the RHS expresses the number of elements in a 
rhombus of radius p, when p is an integer. Solving for p, 



P = 



-1 + y/^Qw,m ^ 1 



(36) 



Observe that 



lpip+1) 2 ^- < 2p{p+l) 2 ^+[Q Wim -l-2{p+l)p] (p+1). 

(37) 

Indeed, the above is an equality when p is integer, as the 
second term of the RHS vanishes. Moreover, it is easy to see 
that the LHS is a convex and increasing function of Q WtTn , 
(i.e., if we substitute p = p(Q w . m ) from (36)), whereas 
the RHS is piecewise linear on Q w ^ m (the pieces' endpoints 
correspond to integer values of p). Comparing the two, the 
LHS cannot exceed the RHS, as illustrated in Fig. 5. Thus, 



J2h(n,m)>2p(p+1) 



2p+l V2 



(Qru,m-1) \ Q 



1 



with the equality b eing tru e when p is integer. Moreover, since 

f(x) = (x — 1)\J x — \ - x(\/x — 1) > for x > 1 (as 
f(l) = 0, /'(l) > and f"(x) > for x > 0), it is 



E Hn,m) > -£-Qv 



Qw.r. 



(38) 



Then, the total number of hops for each to is expressed as 



\/2 



2£M 



w£W m neQu 



wEW„ 



> ^ 



3 



.w£W n 




Qw,m 
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O Node 

O Node w m 

• Node 

Set Q 
Nodes 1 hop away 

Nodes 2 hop away 
Nodes 3 hop away 



Nodes 4 hop away 
(last incomplete ring) 




Fig. 4. A cluster of peers served by node w m for data m of p = 3. 



Fig. 5. The RHS of (37)) (dashed line) 
and the LHS of (37) versus the cluster 
size Q w , m : knots indicate the points of 
p{Qw,m) = 1,2,..., where (37) is an 
equality. 



Fig. 6. The square cluster Q w of 
the nodes served by node w, with 
the I-shaped and L-shaped routes 
for nodes n and n', respectively. 



where the last inequality is Jensen's inequality applied on 
the convex function g(x) = x (y/x — 1) for the average node 
count per w, J2 W Qni,mW~ l . 
In total, 

As this inequality is true for the optimal values of the AT 
problem, the result follows. ■ 

Proof of Lemma 5: Consider density vectors \d m ] and [d' m ] 
both satisfying the constraints, and their convex combination 
d' m = Xd m + (1 — \)d' m for A G (0, 1). Function ^ is strictly 
convex, hence C CD ([<J < XC CD ([d m ]) + (1 - A)C CD ([<J), 
i.e., the objective C CD ([d m ]) is strictly convex. Moreover, the 
constraints iV _1 < d m < 1 are convex, and the constraint 
zCmeA-i dm — ^ * s a ffi ne - Thus, the optimization is strictly 
convex [20, §4.2.1], and has a unique optimal solution. ■ 

Proof of Lemma 6: Let us first prove that = 



{1,2, 



I — 1}. Suppose that in the optimal solution d^*, 
A / J T contains rri2 but not mi, with mi < m%. Due to the 
ordering, it is p mi > p m2 - We then construct a new solution 
d' m by exchanging the densities of mi and mi, i.e. d' m2 = c?™* 
and d' mi = d™*, and maintaining the densities of any other 
m: d' m = d™* for m e M \ {mi, m?}. 
The link rate of the two solutions differ by 



C CD *-C" 



'CD* 

mi 



(Pr, 



V2 
6 



{Pm\ Pm 2 ) 



- 1 



If p mi = p m .,, d' m has the same link rate C CD as the d%%*, 
therefore it is optimal, too. This means that we can replace 
d^* with d' m to construct a second optimal solution, which is 
a contradiction as the optimal solution is unique (Lemma 5). 

Consider now the case of p mi > p m ,- As mi ^ M. , it is 
dm* = d'm 2 < 1- Clearly, from the above equation, p mi > p m2 
implies that C CD * - C" CD > 0. Thus, d™* is not optimal, which 
is a contradiction to the hypothesis about mi and m.2. 

Similarly, for the M. part, a solution with m% gM , 7714 ^ 
A4 l and m 3 < leads to a contradiction. ■ 




Proof of Theorem 7: The first part has already been shown. 
For the second part, note that e?™* < 4d m . Thus, 



2C CD *= 



Proof of Lemma 8: Observe that on each iteration of k, 
Algorithm 1 adds data to a cache with the least items. Thus, for 
cache n to end up with K + l or more items, all other n' 7^ n 
should have at least K items, i.e., J^neAf \^ n \ — + ^ 

This is a contraddiction: each item m appears in Nd° m = 
A v ~ Um caches, and from d° n construction, it is 

]T \B n \ =Nj2d° m ( < N d "n < NK. M 

nGAC mGM mEM 

Proof of Theorem 9: Given the shortest path routing and 
the replication pattern, we can compute the expected number 
of hop counts from a random client to the node serving its 
request. Indeed, consider that a node in VV m that stores m in 
its cache, and let us compute the summation of hops to the 
client nodes in Q w , m it serves. Q Wt m is a set of nodes laying 
on a square of size 2 V ™ x 2"™, with w m at its center (Fig. 6). 
From symmetry, we can perform the summation by counting 
twice the required hops along one axis. For d° n < 1, 



Movements on the vertical direction 



columns 



neQ 



h{n,m) = 2x + . . . + 2<- 1 +l + . . . + {2<- 1 -l)]2< 



Given the Nd m nodes of W m , the average load across all 
links is computed by summing across m from (9): 



iEE ^-H {dli<1}Pn 



2N 



m£M w£W„ 



l - £ Nd m 2^'H {d0m<1}Pn 



< 



2N 

m£M 
1 1 



E 

men 



/2C C 
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Given that (Theorem 4) C CD * < C" AN * < C ANo , and (Theo- 
rem 7) C CDo < 2C CD * + V2/ 6 , it follows that 

Proof of Theorem 10: Consider a link £ of the network and 
a data to, and let us calculate the load Ce. m incurred on £ for 
requests on data to. Traffic regarding to flows through link £ 
only if I links nodes of the same cluster Q w . m . Next Lemma 
computes a bound on the load Ce, m - Note that Algorithm 1 
constructs a square-shaped cluster Q w . m of size 2 U ™ x 2 !/ ™. 

Lemma 21 [Link Load per Data]: 

• If link I is adjacent to nodes belonging to different 
clusters Q Wm or v° n = 0, then Ce. m = 0. 

• If I is in the same row or column with w m , then Ce >m < 

2"m-l (2"m"2 + \)p m . 

• If £ is not in the same row or column with w m , then 



C. 



< 2 V 



Proof: Fig. 6 depicts links £ and £', which constitute the 
worst case (w.r.t. carried traffic) for row links in the same and 
different rows to w m (column links can be treated similarly). 

Clearly, for link £, factor 2 U ™~ 1 counts the number of 
columns that lie from £ and outwards to the boundary of Q w , m , 
2 V ™~ 2 + | is the number of nodes in each of these columns, 
with 1/2 accounting for two-path routing for the nodes in 
different rows than £, and p m is the popularity of to. 

Similarly, for link £', 2 V ™~ 2 is the number 2 V ™~ X of nodes 
that are served through link £' multiplied by 1/2 due to two- 
path routing, and p m is the file popularity. ■ 

Using the above, we bound the load of an arbitrary link £, 
considering the case of I being a row link (column links can 
be treated similarly), located at row ye (ye takes values from 
1 to 2 V ) summing over all data to that have v° m > 0: 



m£M meM\M 



\-{jlw m m the row of £} 



+2"m- 1 (2"m- 1 + -)l 



< 



m£M\Mo 



'■{3tu m in the row of £} 



\ ~* 9 2i/° -2 -n 

A Pm 1L { : 3w m m the row of t} 



m£M\Mo 



As l/^fd^ = 2 V ™, the first summation is less or equal to 
2C ANo + 2. As for the second term, Lemma 23 that follows, 
bounds it by 2C ANo + 2 + A , /4. Using Theorem 9 as well, 

C WNo =max Q < 4C ANo +2+ -M < 2 + +6\/2C AN *. 
l ~ 4 ~ 4 

Given, moreover, that C AN * < C™*, the proof is complete. ■ 
Before establishing the bound of Lemma 23, we first make 
the following observation on the workings of Algorithm 1: 

LEMMA 22: For every row y, and at every step of Algo- 
rithm 1 's run, each replicated file that lies in the buffers of the 
nodes at row ye can be uniquely matched to another previously 
added replicated file in the buffers of the nodes at row y, 
excluding from the matchings the files of Mo and the replicas 
of the first file from the set M \ Mo added to row ye- 



Proof: Observe that the Algorithm operates on the 2 fe x 2 fe 
submatrix filling in data to the diagonals from set Mk- When 
Mk gets exhausted, the algorithm moves to Mk+i operating 
on the 2 fe+1 x 2 fe+1 formed by repeating the 2 fe x 2 k submatrix. 
Thus, if we exclude the files of Mo, duplicated to all nodes, 
and the first time we visit row ye to add a file 2 (this will regard 
node {ye, ye) of the first diagonal), we know for sure that by 
the time we revisit row y t (at the node of the second diagonal), 
we will have already visited all other rows. Thus, the file 
added once to the first 2 k x 2 k submatrix can be matched 
uniquely to another element previously put in the buffer of 
row y. Similarly, on the third and so forth visit of row ye to 
add a new file to the first 2 k x 2 k submatrix, we will have 
already have revisited row y to add another file in the the first 
2 k x 2 k submatrix, and thus we can match these two files. 

The above is true for the whole 2 k x 2 k network, too. When 
Mk gets exhausted, we replicate the 2 k x 2 k submatrix to the 
2 k+1 x 2 k+1 submatrix (step 4); in this process, the matchings 
in the 2 k x 2 k submatrix between the replicas of files in the 
buffers of row ye to the ones in row yi are replicated to the 3 
new 2 k x 2 k submatrices of the large 2 k+1 x 2 fe+1 submatrix. 
Thus, we established the matching of each replicated file in 
row ye excluding the files of Mq and the replicas of the first 
file from M \ Mo to the replicated files in row y. ■ 

LEMMA 23: For any row y = 1,2,..., 2", 

4 aye C;H — > ^ ' 2 m Pm^-{m exists in raw y}- 
m£M\Mo 

Proof: Continuing Lemma 22, let rhj be the file added in 
row ye with j denoting their order, excluding set Mo- Each 
one appears 4^ times in the network, and 2 V U ™J times 
in row ye. Let us index these 2 U v ™i replicas of rhj in row 
yt for j > 2, by r = 1, 2, . . . , 2 V Vfh i . In the matching of 
Lemma 22, remember that we excluded the replicas of the 
first data added out of M \ Mo, that is, the replicas of rh\. 

Let, moreover, TOj. y r be the matching of the r replica of 
rhj at row ye to the (replicated) data at row y. As already 
discussed, rhj, y , r has been added to the network before rhj. 
This observation creates the following cases: 

Case 1: Both rhj and rhj.y.r belong to set Ml~. Then, 



1: 



AM v Md%h* 



Pr, 



(12) 
> 



AA 



M ,M 



Pm 



m,j , 



and similarly, 1 = p^ Prf, iiSiP = 

3 
2 

Prhi v r _ 



AM^M,^*^, 



T 1 



A My Mfin 



Prhj iy , r ■ 



Combining these two inequalities, and using d° 



,3i/2, 



Pn 



< 2 m 3.V,rp^ 



(39) 



Case 2: rhj e M t , rhj y r € -Mj- Clearly, j/^ — v, whereas 
v °m 3 G {1,2,..., i^}. The fact that rh j y r <G M t imposes a 
constraint on the probabilities p^ vs. Pm jiVir of (lib): 



Pm jl9 , r > 8 m ^y- 



"Prhi = 2 '■"• r Pm i , 



2 To be precise, this will happen when we first visit row ye mod 2 k — the 
contents of this row will get repeated to row ye when k becomes 2 fc > ye- 
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i.e., if y r = v, or equivalently A -1 < d$?* r < 47V" 1 , 
the above reads as p TO . r > p,^ . as expected. However, if, for 
example v°~ = v — 1, or equivalently 47V" 1 < cfj?* < 

16A" 1 , it has to be p m . y r > 

8pm, so that (lib) is valid. 
Hence, (39) holds in this case, too. 

Case 3: Both mj and mj. y r belong to A / J r Then, z/^. = 
i/^.. Then, given that p™ 3 s r > Pa,-, (39) is valid, too. 
Summing (39) for all y, j > 2 and r = 1, 2, . . . , 2" , 

2 , E E 2**,-^ < E E E 2^r^ j y r * 

j>2r>l J/ j>2r>l 

^E^-V^EEE^*^ (40) 



For r > 1, it is C 4 ( = ©(//V [r 1 - r -M 1 - T ]). 

If r< M, too, then M 1 " T <r 1 " T , hence = o(0r 



Proof of Theorem 15: Case r < §: From Lemma 11, Z — >• 1. 
Case r > §: Examining what happens in (20)-(21) in the 
limit, we observe that r — > oo, hence H^ir — 1) — > i?2r. 



Assuming a limit Z — > Z 



^ 0> , (20)-(21) lead to (27); the 



i>2 



» 3>2r>l 



where in the last step we used the number 2 *i of replicas 
of each rhj on row yg on the LHS to eliminate the inner sum. 

We do the same for the RHS: the triple summation en- 
compasses almost all the replicas placed in the network. The 
'almost' regards the replicas added in the network after the 
addition of the last file rhj in row y t which do not participate 
to the matching. Thus, we can increase the RHS to encompass 
all replicas in the network. As each file m e M\ Mo has 
& v ~"m replicas in the network, the RHS is upper bounded by 



{t>§ m~®} 

latter, as in Lemma 5, have a unique solution. ■ 

Proof of Theorem 16: By the definition of M. almost empty, 
it is M — r + 1 = o(M), and given the constraint of (1), it is 
M = 0{N), thus M — r + 1 = o(A) = o((A - I + l)N), as 
A — I + 1 > 1 in all cases from Theorem 15. 

From the last element of M^ we have that <i r _i > A -1 . 
Substituting eZ r _i in the latter from (lib), and taking the limit 



{K-l+l)N > (r-l)ir H^(r-1)-H^(l-1) 



(41) 



]T4— ~2^ m 2 ^ 



Using the above bounds on the RHS with the LHS of (40), 

4 * 2 2 ^- 3 Prh] < A (C ANo + 1) 



J">2 

E 2 ' 



Last, consider the term of fh\ £ M \ Mo in the LHS: 



where we used M — r + 1 = o((A — I + 1) A) to eliminate the 
respective term from the LHS. Next, we use (15) to approxi- 
mate the Riemann terms and substitute Z from Theorem 15: 
Case < t < 3 /x I -> 1, thus (41) becomes 

2 3 3 

As 2r/3 < 1, it is (r— l)~f = o(r— 1). Hence, the above 
is equivalent in the limit to (r — 1) < A (l — 2 f) N, or, as 

r = 0(M), M < A (l - ^) A. 

Case r = 3/2: Z -5- 1, thus A A > (r-1) [ln(r-l) - InZ] . 

lim 

Using that InZ = o(ln(r-l)), we get (r-1) ln(r-l) < AA; 

lim 

asr = 0(M), the condition becomes MmM < AA. 
Case r > 3 /2: 

(A-Z'+1)A> (r-l)T- 



'~3~ (r-1) T- (r-1) 

2r_ 1 



z 1 Prhi — ~]CD* — - 

"mi u mi n? 



Pmx = AM T ,M 1 pi 1 <^o,o, As 2r / 3 > L k follows that (r-1) = o((r-l) 2 3 T ), and the 



where the first inequality is equality or strict inequality de- 
pending on whether fh\ € M l or rh\ G respectively. 
Combining the last two inequalities, the result follows. ■ 



above becomes (r— 1) < 



(K-i+l)(^-l) 



A 



. Substituting 



Appendix B 



lim 



Proof of Lemma 11: If we assume that in the limit Z > 1, 
then we have two cases for r: r — >• 00, or r = 0(1). 

If r — > 00, r < 3/2 leads to i?2r (r — 1) diverging to infinity 
in (22). However, the rest of the terms in (22) are bounded (as 
I < A). Therefore, (22) is a contradiction. Thus, it has to be 
either M % — 0, or Z = 1. As r — > 00 and Z < A + 1, it cannot 
be M t = 0. Therefore, if r ->• 00, it is Z = 1 (i.e., di ^ 1). 

lim 

If r = 0(1), (26) contradicts with Z > 1. Thus, I -)• 1. ■ 

Proof of Lemma 12: Follows from the summation definition 
of (17): observe that d m > A -1 and S me .M Pm < L ■ 



Proof of Lemma 13: For r < 1, C k = ( VN J follows 



r — 1 with M, the condition follows. 

Last, observe that in the above derivations, if we started 
with cZm > A -1 , we would find the conditions for M l being 
strictly empty, i.e. r = M + 1. As easily seen, this is true if 
the conditions are satisfied with strict inequality. ■ 

Proof of Theorem 17: To find C, we compute ? and ; 

from (17)-(18), and show that in all cases C i = o(c^J . Thus, 

C = 0^0^. In assessing C^r^M helps in deriving that 

M 

H T (M) -H T (r-l) = = 0(M" T (M - r)) . 

j=r 

Theorem 15 and M—r = o(M) result in K, = 0(1) for all r. 

Case t < 1: Regarding C s , Hzr(r + 1) and H T (M) diverge, 
while H-2r (Z — 1) is bounded (as Z < A + 1). Thus, 



(18) and the fact that H T (M) - H T (r) ( = } (H T (M)). The 
latter comes from H T (M) diverging and r < M. 



c, = e 



[m 1 - 



M 1 - 



If the condition of Theorem 16 is a strict inequality, C = 0. 
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Otherwise, it is an equality, with M = 0(A), and thus, 



c, 



N- 



'I v " H T (M) ~ >^^Viv M1 _ 

Case t = 1: In C f , H^(M) and H r (M) diverge, while 

(mS-1) " 



i?ar(Z-l) is bounded. Thus, C„ = 



log M 



= 6» 



log M 



As before, if the condition of Theorem 16 is a strict inequality, 
C l = 0. Otherwise, it is an equality, with M = 0(A), thus, 



a = Vn 







JV log M J ~~ 1 log M 



"1 • - ' H T (M) 

Case 1 < t < 3 /2: Regarding C„ only H2 2L (M) diverges, 
while the rest of the terms converge. Then, the order of C. is 



determined from Hi^ (M) 



M 1_ TT -1 


2 = 6>(Mi- r ) 


1 3 





As before, if the condition of Theorem 16 is a strict inequality, 
C — 0. Otherwise, it is an equality, with M = 0(A), thus, 



C, 



yM - 

JV H T (M) 



'(15), 



GWNM 



^ = o(l). In total, C = 0(1). 

M 3 ' 



Case t — 3 /2: C s = 0^1og 2 due to the numerator, all 
other terms are bounded. If the condition of Theorem 16 is 
a strict inequality, C & = 0. Otherwise, it is an equality, with 

M = 0(N), thus, C l = ©(Va^) = &( M ^ L ) = o(l). 
In total, C = (log? M y 

Case t > 3/2: All terms converge in (17), thus C, = 0(1). 
If the condition of Theorem 16 is a strict inequality, C & = 0. 

Otherwise, it is an equality, with M = (Nif^, thus C l = 

g(Vn^) =e( M - 

Proof of Theorem 18: In the second case of A A — M = 
0(1), observe that KN—M is the remaining number of places 
after putting the M files once in the network. Clearly, r < 
KN - M, thus r = 0(1). As both r and I are bounded, (26) 
cannot be true, therefore 1 = 1. Hence, r is estimated from 
(23), (24), which, substituting I -> 1 yields (34). 

For the first part where KN — M = uj(1), we first note that 
r = O(N), as r < M, and M = 0(N), due to (1). 
Case t < |: From Lemma 11, Z — > I = 1. Using this along 
with (15) in (25), we can estimate r: 

KN-M+r-1 = 3(r-l) 5 

3 — 2t 

Observe that assuming r = 0(1), the above results become 
a contradiction, as NK — M = ui(l), whereas all the other 
terms are 0(1). Therefore, it is r = u>(l), and (29) follows. 
Case r = |: From Lemma 11, I — > I = 1. Working as before, 
(25) in view of (15) gives that NK-M + r-l = (r-1) lnr. 
Clearly, r = lu(1), thus, r lnr — KN - M. 
Case r > §: First, we assume Z > 1. Using (22) and (15), 

1, T -i ll 2t " 1 27 " I > 27 " 1 2T 

M-r+1 _ ^ l 1 --s--r 1 ~~3 _ Z - iTrW 
' N 



1 



A-Z + l- 



A-Z+l- 



M 

' A' 



Z 



A 1 



1 

-=3Z 



3 1 
1 



3 1 



2r- 3 



where in the last step we used (26) to substitute r = IN ^ 
For N 1 - 



00, it is A 1 
1^ 2t - 3 



2r 



00, and the above becomes 

AT 



A + 1 - lim 



A 



Thus, the assumption of I > 1 is correct if K, lim m /n 
and r are such that the second factor of RHS approximately 

exceeds 1, i.e., M < (k - 2^3) N - 



Then, from (26), 



2r 



2r 



(A^+ 1)A^ 



M 



A 1 



Otherwise, I = 1, and r is computed from (25) using (15) 



NK-M+r-1 = 3(r-l) 



iT 1 



As A — > 00, it follows that r 



2r-3 



Proof of Theorem 19: First note that from Theorem 18, for 

all r, it is K % = O ( kn-m+t-i ^ = ^ (using M '™ 

In the cases of r < §, A4 4 ^ entails M > AT (l - ^) A 
(Theorem 16). It is also M < ATA, thus M = 0(A). 

lim 

Furthermore, from Theorem 18 and M < KN, it is 

3-2r. 

r ~ — - — 
2r 

3-2r 2r 



-(KN — M) + 1 M = KN 0(A), and, moreover, 



r < 



2t 



-KN = 11- — 
3 V 3 



AA < M. 



(42) 



Then, we compute the link rate as follows: 
Case r < 1: Using Lemma 13, it is C i = & (v'a) • Invoking 

Lemma 12, too, we get that C = ©(-/a) = 0^\/m). 
case r - 1. o 4 -v^v VJV ra - ^^i^mJ , 

0(A) = 0(M). Similarly, as Z 1, C ? = 

= e(^). In total, C = ©(^). 




Case 1< r < 3/2: Using r = 0(A) = 0(M), 



C, 



H T (M)-H T (r) (is, 

H T (M) ~ r r-l 



1- 



which is C i = O |mI" t J from (42). Last, I ->■ 1 implies 



that 



Hl T (r-\) 







In total, C = Mi 



^ H T (M) 

Case r = 3/2: Now, it has to be M ln M > KN, which also 
implies that MlogM = f2(N). From Theorem 18, we have 
that rlnr <~ AA — M. This means that rlogr = 0(A) in 

view of M < KN, and thus r = o(N). 

Moreover, comparing MlnM and rlnr in the above for- 

lim 

mulas, it has to be r < M. The latter implies that there exists 
a < k < 1 such that eventually jj < k. Using then (18), 



C l ( = 0( .V" 



1 

1~ 

T 2 



1 

Mi 



(m) 3 <^ 
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= 



N 



KN-M 



logr 



N K — M=0(N) 



Moreover, as I -> 1, C, = 







((logr) 1 



Thus, in total = ((logr)'). 
Case r > 3/ 2: it is r = O iN^ ) due to M < KN. 
Moreover, for M l ^ 0, it has to be M = J? (n£^J . Then, 

C x = VN ^M-, 1 !*- 1 ) «®> O (Nir*-A 
1 H T (M) V / 

= o (iVi + *(i--)) = o (iV*- 1 ) = 0(1). 

Last, C s = 0(1) (all terms converge). Thus, C = 0(1). ■ 

Proof of Theorem 20: In all the cases, we know that r < 
KN - M + 1, as KN - M is the number of spaces left 
for duplicate copies after all M files are stored once. Hence, 
r = 0(KN — M) — o(N) = o(M). Moreover, as before, in 
all cases, K % = ( ^-M+ r -i j = Q ^ _ 

Case t < 1: From Lemma 13, r = u>(M) implies that C = 
(V^y Hence, invoking Lemma 12, M= ^ (JV) 

For the rest of the cases with r > 1, it is r = o(M), 
therefore, from Lemma 13, we get that C' t = o(^0r^j . 
Case 1 < r < 3/2: Using r = O(KN-M) from Theorem 18, 
C i = ( (atat^m)^-i )- ° n the other hand ' ^ L and thus 



ffL(r-l) 

0? = -J = 



N 



tT 2 



M=e(N) / 
In total, = ( 



(KN-M) 

Case r = 3 /2: From the above, = 



"l - o ( ^ \ 

KN-M' ' ) ~ ^ \(KN-M)t-i)- 
) • 



Moreover, 



= H rl {r -'\ = (J 

fKH i (M) VV 



However, 



log r 
r log r 



q ( logr \ _ / l°g 3 r ^ 
^ ^ KN-M J u \ KN-M J ' 



thus 



C, = o(0 s ). In total, M ^ (W) ( v /3Tlog5 r). ^ 
Case t > 3/2: From Theorem 18, r = ((KN — M)^ = 

o(^7V - M) = o(M). Thus, C 4 = 

Moreover, 



Uv, 



i 



= O 



AT 



KN-M 



VN 
(KN-M)i ~i 

(the H-terms 



converge). As 3( -^ r 1 - > > |, it is 



M=0{N) 



^(\/ KN—M ) ' 
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